COBHAM'S THEOREM FOR SUBSTITUTIONS 



FABIEN DURAND 

Abstract. The seminal theorem of Cobham has given rise during the last 
40 years to a lot of works around non-standard numeration systems and has 
been extended to many contexts. In this paper, as a result of fifteen years 
of improvements, we obtain a complete and general version for the so-called 
substitutive sequences. 

Let a and be two multiplicatively independent Perron numbers. Then, 
a sequence x £ A N , where A is a finite alphabet, is both (^-substitutive and 
/3-substitutive if and only if x is ultimately periodic. 



1. Introduction 

The seminal theorem of Cobham has given rise during the last 40 years to a lot of 
works around non-standard numeration systems and has been extended to many 
contexts. The original Cobham's theorem is concerned with integer base numeration 
system. In this paper, as a result of fifteen years of improvements, we obtain a 
complete and general version for the so-called substitutive sequences. 
A set E C N is p -recognizable for some p € N \ {0, 1}, if the language consisting of 
the p-ary expansions of the elements in E is recognizable by a finite automaton. It 
is obvious to see that E is recognizable if and only if it is p fc -recognizable. In 1969, 
A. Cobham obtained the following remarkable theorem. 

Cobham's theorem. [Cob 69] Let p,q > 2 be two multiplicatively independent 
integers (i.e., p k ^ q l for all integers k,£ > 0). A set E C N is both p -recognizable 
and q -recognizable if and only if E is a finite union of arithmetic progressions. 

It is interesting to recall what S. Eilenberg wrote in his book [Eil74 : The proof is 
correct, long and hard. It is a challenge to find a more reasonable proof of this fine 
theorem. To this aim G. Hansel proposed a simpler presentation in |Han82] . also one 
can see [Per90] or the dedicated chapter in [AS03] for an expository presentation 
where a mistake is corrected in R W06j . 

In |Cob 72j. Cobham make precise the structure of these p-recognizable sets: they 
are exactly the images by letter-to-letter morphisms of constant-length substitution 
fixed points. He also defined the notion of p-automatic sequences: The n-th term 
of the sequence is a mapping of the last reached state of the automaton when its 
input is the digits of n is some given base p numeration system. Clearly E C N is 
p-recognizable if and only if its characteristic sequence is p-automatic. Automata 
provide a nice and easy description of p-recognizable sets whereas substitutions 
afford an algorithm to produce such sets. From there a lot of other characterizations 
have been given, the first and major being in terms of: 

(1) p-definable sets (see [BHMV94 for a survey); 

(2) p-kernel |Eil74j : 

(3) (when p is prime) algebraic series over ¥ P (X) (Chr79| ICKMR80| . 
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This opened a wide range of further works we briefly describe below. 
The Cobham theorem has been generalized to recognizable subsets of N d by A. L. Se- 
menov in [Se m77] for base p numeration systems (also called later standard numer- 
ation systems) to give the so-called Cobham-Semenov theorem (see BHMV94 for 
a nice survey). Then the efforts were concentrated to simplify Cobham-Semenov's 
theorem and to generalize it to non-standard numeration systems given by linear 
recurrence relations like the Fibonacci one. Alternative and satisfactory proofs (in 
terms of simplification) have been proposed, among them a very interesting logical 
proof due to C. Michaux and R. Villemaire |MV93llMV96] (see also |Bes97[|Muc03] V 
using p-definable sets in the formalism of first order logic into some arithmetic mod- 
els like the Presburger arithmetic (N, +) (see [BesOl] for a survey on these meth- 
ods). Another recent proof makes use of ergodic measures |Dur08j and the fact 
that j p-recogn izable subsets of N d are characterized by multidimensional substitu- 
tions [CG86, Sal87 . This last characterization is an extension of Cobham's result 
of 1972. 

The first result obtained for non-standard numeration systems is due to S. Fabre 
|Fab94] . He considered subsets of N that are both p-recognizable and [/-recognizable 
where U is a non-standard numeration system associated with some restricted class 
of Pisot numbers. Then, V. Bruyere and F. Point |PB97j proved such a result for 
subsets of N d under less restrictive (and more natural) assumptions on the involved 
Pisot numbers. Later A. Bes (BesOOj succeeded in generalizing Cobham-Semenov's 
theorem for subsets of N d recognizable by automata in two non-standard numera- 
tion systems which are associated with the minimal polynomials of multiplicatively 
independent Pisot numbers. Up to now it is the best generalization obtained for 
d > 2. 

Prior to this result, for d — 1 , Cobham's theorem was extended to a much wider class 
of non-standard numeration systems in IDur98cj where no "Pisot conditions" are 
needed, and later in [DR09 to abstract numeration systems (defined in |RigOO| ). 
This last result includes all previously known such result in dimension f. ft is 
important to notice that the proofs in Dur98c , DR09 used substitution fixed points 
while the papers [BesOOl IBHMV941 IMV931 IMV961 IMuc031 IPB97] used a first order 
logic approach. 

With a substitution is associated an integer square with non-negative entries. It is 
well-known (see [HJ90 for instance) that such a matrix has a real eigenvalue a which 
is greater than or equal to the modulus of all other eigenvalues. It is usually called 
the dominant eigenvalue of M. This allows us to define the notion of a-substitutive 
sequences. From the characterization given in |Cob72) it is easy to deduce that the 
characteristic sequences of p- recognizable subsets of N are p-substitutive sequences. 
This suggested to G. Hansel the following result (see |AM95j ) that is the main 
result of this paper. 

Theorem 1. Let a and j3 be two multiplicatively independent Perron numbers. Let 
A be a finite alphabet and x be a sequence of A n . Then, x is both a -substitutive 
and j3 -substitutive if and only if x is ultimately periodic. 

Partial answers have been given in [Fab94l |Dur98bj |Pur02a] with conditions on the 
substitutions. 

Observe that the main result in Dur98c on numeration systems is a consequence 
of the Cobham theorem for primitive substitutions established in Dur98b thanks 
to a result of Fabre in |Fab95] that characterizes the characteristic sequences of 
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recognizable sets of integers in non-standard numeration systems in terms of fixed 
points of substitutions. 

Let us mention further generalizations of Cobham's theorem. 

We could weaken the assumption that a sequence is p and g-automatic assuming 
that a p-automatic and a g-automatic sequence share the same language (the set 
of finite words occurring in the sequence), and, asking if a Cobham theorem type 
result still holds in this context. It holds as proven in |Fag97| . It was generalized to 
primitive substitutions in |Dur98cj . Translated to the framework of dynamical sys- 
tems this result means that if two subshifts, one being generated by a p-automatic 
sequence and the other by a g-automatic one (with p and g multiplicatively in- 
dependent), have a common topological factor then it contains a unique minimal 
(which is also uniquely ergodic) subshift, moreover it is periodic. This point of view 
is developped in |Dur02a| and provides a new proof of Cobham's type theorem. It 
uses the ergodic measures of the subshifts and the values they take on cylinder 
sets. Moreover this way to tackle the problem also works in higher dimension. 
For example, this can be used to get a new proof of Cobham-Semenov's theorem 
(see |Dur08j ) using multidimensional subshifts. Moreover as subsets of N can be 
seen as tilings of Ml (which are self-similar, see [Sol97| for the definition) it is not 
surprising to have generalizations to self-similar tiling dynamical systems [CD08 . 
The notion of recognizable subsets of R or R d , in standard numeration systems, can 
easily be defined. In a series of papers [BB07, BBB08 , BBL09 the authors obtained 
a very nice generalization of Cobham's theorem. In dimension one the same result 
(but in a different setting) has been obtained in [ABlOaj independently. 
The case of the ring of Gaussian integers with the numeration systems ((— a + i) n ) n , 
a e N \ {0}, (see KS75]) has been investigated in |HS03j . They obtained a very 
partial result and faced the problem to prove that , ta ° ^ a - ta ^ b , 1 arc 
rationally independent which seems to be a difficult number theoretic problem. The 
generalization remains open. 

In |AS92j . the authors defined the notion of p-regular sequences. They take values 
in a ring and are defined using the notion of fc-kernel. When the ring is finite they 
are fc-automatic sequences. J. Bell generalized Cobham's theorem to this context 
in [Bel07] . 

G. Christol gave in |Chr79] a famous and very concrete description of the elements 
of ¥ q ((t)) that are algebraic over ¥ q (i) (q — p n with p a prime number); it shows 
that being an algebraic power series is equivalent to the sequence of coefficients 
being p-automatic. In |Ked06j K. Kedlaya generalized this theorem to the so-called 
generalized power series of Hahn (F g ((f®))) in terms of quasi-automatic functions. 
Then, B. Adamczewski and J. Bell AB08] proved an extension of Cobham's the- 
orem to quasi-automatic functions and put it together with Kedlaya's result to 
derive an analogue of the main result in |CKMR80) asserting that a sequence of 
coefficients represents two algebraic power series in distinct characteristics if and 
only if these power series are rational functions. 

For more details on all these developments we refer to |AB10bl IDR10] . 



2. Words, morphisms, substitutions and numeration systems 
In this section we recall classical definitions and notation. 
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2.1. Words and sequences. An alphabet A is a finite set of elements called letters. 
A word over A is an element of the free monoid generated by A, denoted by A*. 
Let x = xqX\ ■ ■ ■ x n -\ (with xi £ A, < i < n — 1) be a word, its length is n 
and is denoted by \x\. The empty word is denoted by e, |e| = 0. The set of non- 
empty words over A is denoted by A + . The elements of A K are called sequences. If 
x — xqXi ■ ■ ■ is a sequence (with Xi £ A, i £ N) and / = [fc, I] an interval of N we 
set Xi = XkXk+i ■ ■ ■ xi and we say that xi is a factor of If /c = 0, we say that Xi 
is a prefix of £. The set of factors of length n of x is written C n (x) and the set of 
factors of x, or the language of a;, is noted C(x). The occurrences in x of a word u 
are the integers i such that = u. If u has an occurrence in x, we also say 
that u appears in x. When a; is a word, we use the same terminology with similar 
definitions. 

The sequence x is ultimately periodic if there exist a word u and a non-empty word 
v such that x = where = vvv ■ ■ ■ . It is periodic if u is the empty word. 
A word u is recurrent in a; if it appears in x infinitely many times. The set of 
recurrent words of x is denoted by C rcc (x). A sequence x is uniformly recurrent if 
every factor u of x appears infinitely often in x and the greatest difference of two 
successive occurrences of u is bounded. 

2.2. Morphisms and matrices. Let A and B be two alphabets. Let r be a 
morphism from A* to B* . Such a map induces by concatenation a morphism from 
A* to B* . When t(A) = B, we say r is a coding. Thus, codings are onto. If t(A) 
is included in B + , it induces by concatenation a map from A N to B N . These two 
maps are also called r. With the morphism r is naturally associated the matrix 
M T = (rrii^icBjGA where rriij is the number of occurrences of i in the word r(j). 
It is well-known that any non-negative square matrix M has a real eigenvalue a 
which is real and greater or equal to the modulus of any other eigenvalue. We call a 
the dominating eigenvalue of M. Moreover a is a Perron number, it is an algebraic 
real number > 1 strictly dominating the modulus of all its algebraic conjugates (see 
for instance [LM95 ) . The matrix M is called primitive if it has a power such that 
all its coefficients are positive. In this case the dominating eigenvalue is unique, 
positive and it is a simple root of the characteristic polynomial. This is Perron's 
Theorem. 

2.3. Substitutions and substitutive sequences. A substitution is a morphism 
r : A* — > A* . If there exist a letter a £ A and a word u £ A + such that cr(a) = au 
and moreover, if lim„_ s ._|_ 00 |a"(a)| = +oo, then a is said to be prolongable on a. 
Since for all n £ N, cr n (a) is a prefix of a n+1 (a) and because |c™(a)| tends to infinity 
with n, the sequence (a n (aaa ■ ■ ■ )) n >o converges (for the usual product topology 
on A N ) to a sequence denoted by cr"(a). The morphism a being continuous for 
the product topology, a u (a) is a fixed point of a: a(cr u (a)) — a u (a). A sequence 
obtained in this way by iterating a prolongable substitution is said to be purely 
substitutive (w.r.t. a). If x £ A N is purely substitutive and if </> : A* — > B* is a 
coding, then the sequence y = <f)(x) is said to be substitutive. 

Whenever the matrix associated with r is primitive we say that r is a primitive 
substitution. We say t is a growing substitution if lim„_^ +00 |r™(6)| = +oo for all 
b £ A. It is erasing if there exists b £ A such that r(6) is the empty word. 
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Definition 2. Let A be a finite alphabet. A sequence x G B N is said to be a- 
substitutive (w.r.t. a) if a : A* — » A* is a substitution prolongable on the letter a 
such that: 

(1) all letters of A have an occurrence in ^(a); 

(2) a is the dominating eigenvalue of the incidence matrix of a; 

(3) there exists a coding <p : A* B* with x = c^(cr w (a)). 

If moreover a is primitive, then </>(cr w (a)) is said to be a primitive a-substitutive 
infinite sequence (w.r.t. a). 

The condition ([1]) is important. Indeed, let consider the substitution r defined by 
r(a) = aaab, r(b) = be and r(c) = cb. It has three fixed points r"(a), T w {b) and 
t u (c). The sequence r w (a) is 3-substitutive and we do not want to say that t"(&) 
and t"(c) are 3-substitutive. With our definition they are 2-substitutive. 

2.4. Growth type and erasures. The following well known result (see Chapter 
III. 7 in [SS78] ) will be very useful in the sequel. 

Proposition 3. Let a : A* — > A* be a substitution. For all a E A, one of the 
following two situations occurs 

(1) 3N G N : Vn > N, \a n (a)\ = 0, or, 

(2) there exist d(a) G N and real numbers c(a),0(a) such that 

urn — - — ~tz r — —— = 1. 

n^+oo c (a) n d (°) 0(a) n 

Moreover, under the situation ([2]), for all i G {0, . . . , d(a)} there exists a letter 
b G A appearing in a J (a) for some j G N satisfying 

\a n (b)\ 
lim ' = 1. 

n^f+oc c(b) n l 9(aj n 

This justifies the following definition. 

Definition 4. Let a : A* — > A* be a non-erasing substitution. For all a G A, the 
couple (d(a),9(a)) defined in Proposition^ is called the growth type of a and 9(a) 
is called the growth rate of a (w.r.t. a). The growth type of a word is the maximal 
growth type of its letters. If (d, 9) and (e, /3) are two growth types we say that (d, 9) 
is less than (e, /3) (or (d, 9) < (e, (3) ) whenever 9 < (3 or, 9 — (3 and d < e. 

We say that a G A is a growing letter if (d(a),9(a)) > (0,1) or equivalently, if 
lim n _j. +00 |er n (a)| = +oo. 

We set 9 := max{6*(a) | a G A}, D := max{ci(a) | a G A, 9(a) = 9} and A max := 
{a G A | 9(a) = 9, d(a) = D}. The dominating eigenvalue of M is 9. Consequently, 
any sequence which is substitutive w.r.t. a is 9-substitutive. We will say that the 
letters of A max are of maximal growth and that (D, 9) is the growth type of a. 
Observe that if 9 = 1, then in view of the last part of Proposition [31 there exists 
at least one non-growing letter (of growth type (0,1)). Otherwise stated, if a 
letter has a polynomial growth, then there exists at least one non-growing letter. 
Consequently a is growing (i.e., all its letters are growing) if and only if 9(a) > 1 
for all a G A. We observe that for all k > 1, the growth type of a k is (D, 9 fc ). 
The following theorem allows us to suppose that the substitutions we deal with are 
non-erasing. We refer to [CN031 Theoreme 4] for the proof of this theorem even if 



6 



FABIEN DURAND 



Property (3) is not stated in this paper but is clear from the proof. Other references 
with other proofs are in |Cob68| IPan831 IAS03| lHon09] . 

Theorem 5. Let a : A* — > A* be a substitution prolongable on the letter a and 
4> : A* — > C* be a morphism such that 4>(a u '{a)) belongs to A N . Then, <j>(tj u (a)) — 
i/j(T w (b)) where: 

(1) ip : B* — > C* is a coding; 

(2) r : B* — > B* is a non-erasing substitution prolongable on b; 

(3) there exist 7 : A — > B* and k such that 

7 o o~ = t o 7 and ^07 = 0. 

From the classical theory of non- negative matrices (see |HJ90j ) we deduce the fol- 
lowing corollary. 

Corollary 6. The image by a non- erasing morphism of an a- substitutive sequence 
is an a k -substitutive sequence with respect to a non-erasing substitution for some 
k. 

Proof. Let a : A* — > A* be a substitution prolongable on the letter a and a the 
dominating eigenvalue of its incidence matrix. Let y = cr" (a) and <j) : A* — > C* be 
a non-erasing morphism. We can suppose all letters of C appear in some words of 
<fi(A). It suffices to show that 4>(y) is a fe -substitutive for some k. 
From Theorem [5] we can suppose <f>(y) — ^(^(b)) where ip and r satisfy (1), (2) 
and (3) of this theorem. It suffices to show that the dominating eigenvalue of the 
incidence matrix M T of r is a k (where k comes from (3) of Theorem [5]). Let M^, 
and M a be the incidence matrices of the corresponding morphisms. Let v be 
an eigenvector of M a for the eigenvalue a. Then, a k M 7 v = M T M 7 v. As <fi is non- 
erasing and M^,M 1 = M^, M 7 v is a non-zero vector. Hence it is an eigenvector of 
M T for the eigenvalue a k . 

Conversely, let /3 be the dominating eigenvalue of M T and wbea corresponding left 
eigenvector. Then, wM 7 M k = f3wM 1 . As i/j is a coding, there exists w' with non- 
negative coordinates such that w = w'M^p. Thus, w' M^M k = f3w' M^. All letters 
of C appearing in some words of 4>{A), w' ' is a non- negative vector different from 
0. This concludes the proof. □ 

Once Theorem [T] will be proven, it will show that, in this corollary, the non-erasing 
assumption cannot be removed. For example, we can consider a defined by <r(a) = 
06, a(b) = bac and o~{c) = ccc, and, the (erasing) morphism (f> defined by (j)(a) = a, 
<p(b) = b and 4>(c) is the empty word. Then, a u (a) is 3-substitutive while (/>(<7 w (a)) 
is a non ultimately periodic 2-substitutive sequence. 

3. Proof of Theorem Q] 

3.1. Already known results used to prove the conjecture. The proofs of 
most of the generalizations of Cobham's theorem are divided into two parts. 

(i) Dealing with a subset X of integers, we have to prove that X is syndetic. 
Equivalently, dealing with an infinite sequence x, we have to prove that the 
letters occurring infinitely many times in x appear with bounded gaps. 

(ii) In the second part of the proof, the ultimate periodicity of X or x has to 
be carried on. 
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The first part is already known in the framework of substitutions. 

Theorem 7. DR09, Theorem 17] Let a, (3 g]1,+oo[ be two multiplicatively inde- 
pendent real numbers. If a sequence x is both a- substitutive and j3 -substitutive then 
the words having infinitely many occurrences in x appear in x with bounded gaps. 

The second part is proved in the context of "good" substitutions in |Dur02bj and 
was proven previously in many others (see [DRlOj for a survey). 

Definition 8. Let a : A* — > A* be a substitution whose dominating eigenvalue is 
a. If there exists B C A such that for all b G B, o~(b) G B* , then the substitution 
t : B* — > B* defined by r(6) = o~(b), for all b G B, is a sub-substitution of a. 
The substitution a is a "good" substitution if it is growing and has a primitive 
sub-substitution whose dominating eigenvalue is a. 

Not all non-erasing substitutions have a primitive sub-substitution, for example 
i-4 010, 1 4 2 and 2 4 1. But all have a power that has at least a primitive 
sub-substitution ([Dur02b ]). But, even up to some power, some are not "good". 
For example the substitution a defined by H > 0100, 1 i-4 12 and 2 i — ^ 21 has 
a unique primitive sub-substitution (r : 1 i— > 12 and 2 4 21) but is not "good" 
because the dominant eigenvalue of a is 3 and it is 2 for r. 

We recall that in the (non-erasing) purely substitutive context the expected exten- 
sion of Cobham's theorem is known. 

Theorem 9. |Dur02b| Corollary 19] Let a : A* -4 A* and r : A* -4 A* be two 

non-erasing growing substitutions prolongable on a G A with respective dominating 
eigenvalues a and (3. Suppose that all letters of A appear in cr"(a) and in t u (a) 
and that a and (3 are multiplicatively independent. If x = o~ u {a) = r w (a), then x is 
ultimately periodic. 

3.2. Concatenation of return words. Let A be a finite alphabet. Let x G A N 
and u G C{x). A return word to u (for x) is a word w such that wu belongs to 
C(x), u is a prefix of wu and u has exactly two occurrences in wu. The set of return 
words is denoted by TZ x (u). A sequence x G A N is linearly recurrent (with constant 
L) if x is uniformly recurrent and for all w G lZ x (u) we have \w\ < L\u\. 

Lemma 10. DHS99, Theorem 24] Let x G A N be a non-periodic linearly recurrent 
sequence (with constant L). Then, 

(1) for all w G 1Z x (u) we have ^ < \w\; 

(2) there exists a constant K such that for all u G C(x) we have #TZ x (u) < K . 

The following lemma will be, in some sense, the last decisive argument to prove 
Theorem [TJ When W is a set of words, W* stands for the set of all concatenations 
of elements of W. 

Lemma 11. Let x be a non-periodic linearly recurrent sequence (with constant L). 
Then, there exists a constant K such that for all u G C{x) and all I G N, 

# U <„</ (TZ x (u)* n C n {x)) < (1 + K)W . 

Proof. Let K be the constant given by Lemma [TD] Let I, n G N with n < I, 
u G C(x) and w G lZ x (u)* n C n (x). As the distance between two occurrences of u 
in w is at least \u\/L, in w there are at most Ll/\u\ occurrences of u. Hence w is 
a concatenation of exactly Ll/\u\ words belonging to TZ x (u) U {e}. This concludes 
the proof. □ 
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The following proposition allows us to apply this lemma to primitive substitutive 
sequences. 

Theorem 12. Dur98a] Theorem 4.5] All primitive substitutive sequences are lin- 
early recurrent. 

Note that it is explained in [Dur96l through the Chacon example that uniformly re- 
current substitutive sequences are primitive substitutive. In [DurlOj it is explained 
the generality of the treatment of the Chacon substitution. The same result has 
been obtained in [DL06] but with a significant longer proof. 

3.3. Reduction of the problem. Let us make two obvious but important re- 
marks. Let a, p > 1. When a sequence is a-substitutive with respect to a substi- 
tution cr, it is also a^-substitutive with respect to a k . Moreover, for all positive 
integers I and k, a and (3 are multiplicatively independent if and only if a 1 and f3 k 
are multiplicatively independent. Hence due to the statement we want to prove, we 
can suppose, if needed, that a has the properties some <r k would have (changing a 
by a k if needed). Hence we can always suppose that a fulfills: 

(1) a is non-erasing (Corollary [6]); 

(2) a is growing or there exists a growing letter a £ A such that er(a) = vau (or 
uav) with {e} where B is the set of non-growing letters ([Pan84 ( 
Theoreme 4.1]); 

(3) a has a primitive sub-substitution (see |Dur02b] ]). 

3.4. Technical lemmata and proof of Theorem [TJ Before to prove the main 
result of this paper we need to establish some lemmata. 

Lemma 13. Let x be a substitutive sequence with respect to a growing substitution. 
If each word in C vec (x) appears with bounded gaps in x then there exists a primitive 
substitutive sequence y such that C vec (x) = C(y). 

Proof. Let a : A* — > A* be a growing substitution prolongable on the letter a 
and (f> : A* — >• B* be a coding such that x = 0((7 w (a)). We can suppose that all 
letters of A occurs in some a™ (a), n € N. Proposition 15 in Dur02aj asserts that 
growing substitutions have at least one primitive sub-substitution. Let r : C* — > C* 
be a primitive sub-substitution of a. It is necessarily a growing substitution and 
it is easy to check that each word cr™(6), b £ C, n £ N, is recurrent in a" (a). 
There exist c £ C and a positive integer I such that c is a prefix of er'(c). Thus 
y = (j> ((c ) (c)) exists, is primitive substitutive and C(y) C C rec (x). As primitive 
substitutive sequences are uniformly recurrent, from the hypothesis we deduce that 
C Iec (x) = C(y). □ 

Lemma 14. Let x be a sequence and u be a word such that: if u — v k then k = 1. 
Suppose that each word in C rec (x) appears in x with bounded gaps and that C rec (x) 
contains {«"; n £ N}. Then, x is ultimately periodic and C rcc (x) — C(uuu ■ ■ ■). 

Proof. Let u be such that: if u = v k then k = 1. Let us reproduce arguments 
already used in |Dur02bl Theorem 18] in order to conclude that x is ultimately 
periodic. As u belongs to C ICC (x), it appears with bounded gaps in x. Thus, the 
set 1Z X (u) of return words to u is finite. There exists an integer N such that all the 
words wu £ C{xnXn+i • • ■ ), w £ lZ x (u), appear infinitely many times in x. Hence 
these words appear with bounded gaps in x. We set t = xn%n+i We can 
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suppose that u is a prefix of t. Then t is a concatenation of return words to u. Let 
w be a return word to u such that wu belongs to C(t). It appears in some power of 
u: wu = su k p where k > 1, s is a suffix of u, with |s| < and p a prefix of u, with 
|p| < Hence p is also a suffix of u and there exists p' such that u = pp' = p'p. If 
p and p' are non-empty then u = v k for some v and k > 2 (see [Lot83, Proposition 
1.3.2]). This is in contradiction with our assumption. Consequently, p or p' is 
empty. Doing the same with s we will deduce that necessarily w = u. It follows 
that t — u u , x is ultimately periodic and £ roc = £(t) = C(uuu ■ ■ •). □ 

We have seen that Theorem [1] was proven in |Dur02b] in the context of "good" 
substitutions. The following lemma, together with Theorem[7j is the main argument 
to treat, in Theorem [TJ the case where one of the two substitutive sequences is not 
"good" . 

Lemma 15. Let x be purely substitutive w.r.t. a : A* — > A* satisfying (|TJ) and @ 
of Section \3.3[ having a dominating eigenvalue a strictly greater than 1 and such 
that all letters of A occurs in x. Suppose there exists a coding <f> : A* — > B* such 
that all words belonging to C ICC ((f>(x)) appear with bounded gaps in <p(x). Then, 
either all letters of A have the same growth rate (w.r.t a) or <j)(x) is ultimately 
periodic. 

Proof. Suppose a is prolongable on a', x = cr LJ (a') and y = 4>{x). Suppose a has 
at least two growth rates and let us show </>(x) is ultimately periodic. The growth 
type of a is (d, a) for some d. 
We consider two cases. 

Suppose a is not a growing substitution. Then, from Assumption (j2J in Section r3.3[ 

there exists a growing letter c£i such that cr(c) = vcu (or ucv ) with u €E B* \ {e} 

where B is the set of non-growing letters. It is convenient to notice that a n (e) — 

a n ~ 1 (v)cua(u) ■ ■ ■ ct" _1 (u) or <j"(c) = a n ~ 1 (u) ■ ■ ■ a(u)uca n ^ 1 (v) and there exist 

two distinct positive integers i and j such that a 1 (u) = a 3 (u) . 

Let us show that c necessarily belongs to C rec (x) whenever x is not ultimately 

periodic. 

Suppose c — a! and o~(c) = vcu. If v is not the empty word then cr(c) = cv'cu and 
c is clearly recurrent in x. If v is the empty word then o~ n (c) = cua(u) ■ ■ ■ a n ^ 1 {u). 
Consequently, because er l (it) = o~ J (u), x is ultimately periodic. Suppose c — a' and 
<t(c) = ucv. Then a' is a non-growing letter. This contradicts our assumptions. 
Suppose c 7^ a'. Then a rn (a') = a 1 set, for some m, and c is clearly recurrent. 
Finally we can consider c is recurrent in x. 

The letter c, and consequently c"(c), n G N, having infinitely many occurrences in 
x, it is also the case for the words u' n J = {a 1 (u)a l+1 (u) ■ ■ ■ a 1 ^ 1 (u)) n , n £ N. Then 

all words in {(0( W «))";neN} appear infinitely many times in <p(x). Lemma ITS 

implies that <j>(x) is ultimately periodic. 

Suppose now that a is a growing substitution. Then, all letters of A have an 
exponential growth rate 7 with 7 > 1. From the assumptions, the numbers 

f3 = max{#(a) | 6(a) < a, a £ A} and / = max{d(a) | 9(a) = (3, a E A} 

exist and (/, /?) is a growth type of some letter of A with 1 < ft < a. 

It necessarily exist two letters occurring infinitely maning times in x, one having 

growth type (/,/?) and the other having an exponential growth rate a. Hence, 
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there exists a non-empty word awb, appearing infinitely many times in x, where 
w has growth type (/,/?), and, a and b have an exponential growth rate a. Once 
we observe that for all letters c with exponential growth rate a there exist letters 
c',c" with exponential growth rate a such that <j(c) = u'c'u = vc"v' where u and 
v (being possibly the empty word) have exponential growth rates strictly less than 
a, then, by a recurrence starting with awb, it is easy to prove that there exist two 
sequences of letters (a„)„ and (b n ) n and two sequences of words (u n ) n and (v n ) n 
such that for all n G N: 

(1) a n u n a(un-i) ■ ■ ■ (J n ~ 1 (ui)a n (w)a n ~ 1 (v\) ■ ■ ■ a(v n -i)v n b n appears infinitely 
many times in x; 

(2) a n has growth type (d n ,a); 

(3) b n has growth type (e n ,a); 

(4) u n has growth type (/ n ,/?n) with (3 n < a and \u n \ < max cSj 4 |<r(c)|; 

(5) v n has growth type (#„,7 n ) with 7„ < q and \v n \ < max cGi |cr(c)|. 

Consequently, for all n and k the word fc) = a k (a n )U(n, k)a k (b n ), where 

U(n, k) = a k (u n ) ■ ■ ■ <7 k+n - 1 (u 1 )a k+n (w)a k+n - 1 (v 1 ) ■ ■ ■ a k (v n ), 

appears infinitely many times in x. There exist a strictly increasing sequence (to,) 
and two letters a and b such that 

(3.1) a mi — a" and b„ H = b" for all i. 

From Lemma IT3l there exists a linearly recurrent sequence z such that 
(PI) C(z) = £ Iec (y). 

If z is periodic, we achieve the proof with Lemma 1141 Hence we suppose z is not 
periodic. 

From the lemmata HU1 and ITT1 there exist some constants K, d e, such that: 

(P2) for all u £ £ TCC (y) and w £ Tly(u) n C t ec{v) w e have ^ < \w\ < K\u\; 
(P3) for all u G C rcc (y) we have #TZy(u) fl C rec (y) < K; 
(P4) for all u G £ rec (y) and all Z G N; 

# u <„<; (72,,(u)* n n £rec(y)) < (1 + K)w . 

(P5) for all n and fc 

|/7(n,fc)| < K {{n + k) f+1 (3 n+k ) , 

^k d a k < \<J k {a")\ < Kk d a k and 
K 

— k e a k < \<J k {b")\ < Kk e a k . 
K 

From (I3.ip and the previous inequalities, there exists k such that |<7 fc (&")| < \a k (a")\ 
(the other case can be treated in the same way), 2(1 + K) < |<r fe (6")| and 

(3.2) \U(rrH, k)\ < \<J k (b")\ for aU 1 < i < (1 + K ) 2{K+3){K+1)K + 1. 

From (PI) and (P2), for all j, all words in C iec (y) fl Cj(y) appear in all words of 
£rec(y) H CrK+i)j(y)- Let u be a prefix of cj)(a k (b")) such that 
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J«l-i<M<M. 

K + 1 - 1 1 - K+ 1 

Then u is non-empty and occurs in 4>(a k (a")). We can decompose 4>(a k (a")) and 
4>{a k {b")) in such a way that 4>(a k (a")) = Q p uQ s and 4>(o- k (b")) = «i? s with 



10.1 < (if + l)|u|. 

Observe that u, a" , b" and Q s do not depend on i. Moreover, for all i belonging 
to [1, (1 + K) 2{K+3){K+1)K + ^ the wQrd u Q s( f,(u( mhk ^ belongs to C rcc (y), is a 
concatenation of return words to u and satisfies 



\uQ s cj>(U( mi ,k))\ < (K + 3)\a k (b") 

Moreover, 



(3-3) 

(K+3)\a k (b")\ , „ 

# |J 7e u (y)*n£„( 2/ )<(l + X) M < (1 + X) 2(K + 3)(K + 1)K_ 

n=0 

But observe that (|[/(mj, k)\) 0<i<1+ ^ 1+K y(K+3)(K+i)K being strictly increasing, the 
words uQ s (j>{U{mi,k)), 0<i< l+(l+K) 2( - K+a ^ K+1 ^ K are all distinct (and belong 
to ^->o<n<(K+3)\a k (b")\'^-u(y)* H C n {y)). This is in contradiction with (I3.3[) . □ 

Proof of Theorem Ql The sufficient condition is proven in |Dur02b| Proposition 7] . 
The necessary condition is proven in |Dur02b[ Theorem 18] in the context of "good" 
substitution. We conclude reducing the problem to better assumptions (Subsection 
13. 3p . then using Theorem [7] and Lemma [T5l (after we observe that substitutions 
having a primitive substitution and that are not "good" necessarily have at least 
two growth rates). □ 

3.5. For morphisms instead of codings. We start with an example. Let a : 
{0, 1}* -> {0, 1}* be defined by er(0) = 01 and ct(1) = 0. The sequence x = er"(0) 
is "^.^ -substitutive. Now let t : {a, 0,1}* ->• {a, 0,1}* be defined by r(a) = aOa, 
cr(0) = aOl and <r(l) = aOa. The sequence y = a" (a) is 3-substitutive. But 
x = (f>(y) where <f> : {a,0, 1}* -> {0, 1}* is defined by ^(0) = 0, (f)(1) = 1 and 0(a) 
is the empty word. Hence a Cobham like theorem does not hold for erasing coding 
morphisms instead of coding. But thanks to Corollary [6] (and Theorem [5]) it holds 
for non-erasing morphisms. 

Theorem 16. Let a > 1 and /3 > 1 be two multiplicatively independent Perron 
numbers, and, (j> : A* — > C* and ip : B* — > C* , defined on finite alphabets, be non- 
erasing morphisms. Suppose x £ A N is a- substitutive and y € B N is j3 -substitutive. 
If 4>{x) = tp(y) then (f>(x) is ultimately periodic. 
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